紧固件在确保机械的各个部位方面起着至关重要的作用。紧固件表面的凹痕,裂缝和划痕等变形是由材料特性和生产过程中设备的错误处理引起的。结果,需要质量控制以确保安全可靠的操作。现有的缺陷检查方法依赖于手动检查,该检查消耗了大量时间,金钱和其他资源;同样,由于人为错误,无法保证准确性。自动缺陷检测系统已证明对缺陷分析的手动检查技术有影响。但是,诸如卷积神经网络(CNN)和基于深度学习的方法之类的计算技术是进化方法。通过仔细选择设计参数值,可以实现CNN的全部电势。使用基于Taguchi的实验和分析设计,已经尝试在本研究中开发强大的自动系统。用于训练系统的数据集是为具有两个标记类别的M14尺寸螺母手动创建的:有缺陷且无缺陷。数据集中共有264张图像。所提出的顺序CNN的验证精度为96.3%,在0.001学习率下的验证损失为0.277。
translated by 谷歌翻译
现代生活是由连接到互联网的电子设备驱动的。新兴研究领域的新兴研究领域(IoT)已变得流行,就像连接设备数量稳定增加一样 - 现在超过500亿。由于这些设备中的许多用于执行\ gls*{cv}任务,因此必须了解其针对性能的功耗。我们在执行对象分类时报告了NVIDIA JETSON NANO板的功耗概况和分析。作者对使用Yolov5模型进行了有关每帧功耗和每秒(FPS)帧输出的广泛分析。结果表明,Yolov5N在吞吐量(即12.34 fps)和低功耗(即0.154 MWH/Frafe)方面优于其他Yolov5变体。
translated by 谷歌翻译
由于通用的非语言自然交流方法可以在人类之间进行有效的沟通,因此在过去的几十年中,手势识别技术一直在稳步发展。基于手势识别的研究文章中已经提出了许多不同的策略,以尝试创建一个有效的系统,以使用物理传感器和计算机视觉将非语言自然通信信息发送给计算机。另一方面,超准确的实时系统直到最近才开始占据研究领域,每种系统都由于过去的限制(例如可用性,成本,速度和准确性)而采用了一系列方法。提出了一种基于计算机视觉的人类计算机交互工具,用于充当自然用户界面的手势识别应用程序。用户手上的虚拟手套标记将被创建并用作深度学习模型的输入,以实时识别手势。获得的结果表明,拟议的系统将在实时应用中有效,包括通过远程依恋和康复进行社交互动。
translated by 谷歌翻译
在农业部门中使用人工智能以快速增长,以使农业活动自动化。新兴的农业技术专注于植物,水果,疾病和土壤类型的映射和分类。尽管使用深度学习算法的辅助收获和修剪应用处于早期开发阶段,但仍需要解决此类过程的解决方案。本文建议使用深度学习将草莓植物的桁架和跑步者分类,并使用语义分割和数据集扩展分类。所提出的方法是基于使用噪声(即高斯,斑点,泊松和盐和辣椒)来人为地增强数据集并补偿数据样本数量少并增加整体分类性能。使用平均精度,召回和F1得分的平均值评估结果。提出的方法在精确度,召回和F1分别获得91 \%,95 \%和92 \%,用于使用resnet101进行桁架检测,并利用盐和辣椒噪声进行数据集增强;和83 \%,53 \%和65 \%的精度,召回和F1分别用于使用Poisson噪声的RESNET50进行桁架检测,用于桁架检测。
translated by 谷歌翻译
Statistical risk assessments inform consequential decisions such as pretrial release in criminal justice, and loan approvals in consumer finance. Such risk assessments make counterfactual predictions, predicting the likelihood of an outcome under a proposed decision (e.g., what would happen if we approved this loan?). A central challenge, however, is that there may have been unmeasured confounders that jointly affected past decisions and outcomes in the historical data. This paper proposes a tractable mean outcome sensitivity model that bounds the extent to which unmeasured confounders could affect outcomes on average. The mean outcome sensitivity model partially identifies the conditional likelihood of the outcome under the proposed decision, popular predictive performance metrics (e.g., accuracy, calibration, TPR, FPR), and commonly-used predictive disparities. We derive their sharp identified sets, and we then solve three tasks that are essential to deploying statistical risk assessments in high-stakes settings. First, we propose a doubly-robust learning procedure for the bounds on the conditional likelihood of the outcome under the proposed decision. Second, we translate our estimated bounds on the conditional likelihood of the outcome under the proposed decision into a robust, plug-in decision-making policy. Third, we develop doubly-robust estimators of the bounds on the predictive performance of an existing risk assessment.
translated by 谷歌翻译
Quantifying motion in 3D is important for studying the behavior of humans and other animals, but manual pose annotations are expensive and time-consuming to obtain. Self-supervised keypoint discovery is a promising strategy for estimating 3D poses without annotations. However, current keypoint discovery approaches commonly process single 2D views and do not operate in the 3D space. We propose a new method to perform self-supervised keypoint discovery in 3D from multi-view videos of behaving agents, without any keypoint or bounding box supervision in 2D or 3D. Our method uses an encoder-decoder architecture with a 3D volumetric heatmap, trained to reconstruct spatiotemporal differences across multiple views, in addition to joint length constraints on a learned 3D skeleton of the subject. In this way, we discover keypoints without requiring manual supervision in videos of humans and rats, demonstrating the potential of 3D keypoint discovery for studying behavior.
translated by 谷歌翻译
Realistic synthetic image data rendered from 3D models can be used to augment image sets and train image classification semantic segmentation models. In this work, we explore how high quality physically-based rendering and domain randomization can efficiently create a large synthetic dataset based on production 3D CAD models of a real vehicle. We use this dataset to quantify the effectiveness of synthetic augmentation using U-net and Double-U-net models. We found that, for this domain, synthetic images were an effective technique for augmenting limited sets of real training data. We observed that models trained on purely synthetic images had a very low mean prediction IoU on real validation images. We also observed that adding even very small amounts of real images to a synthetic dataset greatly improved accuracy, and that models trained on datasets augmented with synthetic images were more accurate than those trained on real images alone. Finally, we found that in use cases that benefit from incremental training or model specialization, pretraining a base model on synthetic images provided a sizeable reduction in the training cost of transfer learning, allowing up to 90\% of the model training to be front-loaded.
translated by 谷歌翻译
由于行动和状态空间的连续性,策略的多模式,环境中的障碍的存在以及对其他代理的瞬时适应需要,因此协作式携带是一项复杂的任务。在这项工作中,我们提出了一种预测合作人类手机团队的现实运动计划的方法。使用变性复发性神经网络VRNN来对人类机器人团队的轨迹进行建模,随着时间的流逝,我们能够捕获团队未来状态的分布,同时利用交互历史的信息。我们方法的关键是我们模型利用人类示范数据并产生在测试期间与人协同良好的轨迹的能力。我们表明,与基线,基于集中抽样的计划者快速探索的随机树(RRT)相比,该模型会产生更多类似人类的运动。此外,我们通过人类合作伙伴评估了VRNN规划师,并显示出比RRT在与人类计划时能够产生更类似人类的路径并获得更高的任务成功率的能力。最后,我们证明了使用VRNN规划师使用的Lotobot可以通过控制另一个Locot的人来成功完成任务。
translated by 谷歌翻译
跌倒是致命和非致命伤害的主要原因,尤其是对于老年人。身体内部原因(例如疾病)或外部原因(例如主动或被动扰动)可能导致不平衡。主动扰动是将外力施加到人的结果,而被动扰动是由于人类运动与静态障碍相互作用而导致的。这项工作提出了一个指标,该指标允许监视躯干及其与主动和被动扰动的相关性。我们表明,躯干摇摆的巨大变化可以与主动扰动密切相关。我们还表明,通过调节过去的轨迹,躯干运动和周围场景的预期路径和躯干摇摆,我们可以合理地预测躯干摇摆的未来路径和预期变化。这将有直接的预防应用程序。结果表明,躯干摇摆与扰动密切相关。而且我们的模型能够利用全景图中介绍的视觉提示并相应地调节预测。
translated by 谷歌翻译
协作机器人将对家庭服务应用中的人类福利产生巨大影响,而高级制造业中的工业优势需要灵巧的组装。出色的挑战是为机器人指尖提供一种物理设计,使他们擅长执行需要高分辨率,校准形状重建和力传感的灵活任务。在这项工作中,我们提出了Densetact 2.0,这是一种能够可视化柔软指尖的变形表面并在神经网络中使用该图像来执行校准形状重建和6轴扳手估计的光学传感器。我们证明了用于形状重建的每个像素0.3633mm的传感器精度,0.410N的力量,扭矩为0.387mmnm,以及通过转移学习来校准新手指的能力,实现了可比性的性能,训练了四倍以上,只有12%以上数据集大小。
translated by 谷歌翻译